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(54) Dynamic cache replication in a network through reverse tree generation 



(57) The distribution of replica caches of information 
is dynamically optimized among the regions 
(20,22,24,26,28,29) of a network, based upon the 
changing usage patterns. Initially, a single server (40) 
in a first region stores the primary copy of information. 
A router in the first region receives service request mes- 
sages from the plurality of users throughout network, di- 
rected to the server storing the primary copy To deter- 
mine if it is justified to deploy a replica cache, the router 
transmits a monitor request message toother routers in 
other regions of the network, requesting them to monitor 
all service requests in their respective regions, being di- 



rected to the primary copy in the first server. The first 
router then computes an optimum assignment of a rep- 
lica copy of the information, for storage in at least one 
newly assigned server in the network. The first router 
transmits a replication message to the newly assigned 
server, to enable it to store the replica copy of the infor- 
mation and provide service to user terminals in a newly 
assigned set of regions of the network. The first router 
also transmits an assignment message to the other rout- 
ers in the network, to enable them to redirect traffic from 
their respective regions to either the newly assigned 
server or to the first server. 
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Description 

Technical Field 

This invention relates to telecommunications, and 
more particularly to a method and apparatus for optimiz- 
ing the distribution of replica caches of information 
among diverse regions of a network. 

Background 

There is an inherent redundancy in message traffic 
on the internet, that generates an excessive load on the 
backbone network. The caching of certain applications 
and the provision of web proxies on the internet, eases 
some of the unnecessary replication in data transfer. 
The practice of multicasting is another technique to im- 
prove the efficiency of distribution. In multicasting, a 
sender distributes the same document, data, or mes- 
sage to a large number of receivers who are usually dis- 
persed over a wide area. 

To improve efficiency, it is beneficial to adopt a 
scheme in which a copy of the multicast document is 
delivered in each region to a predetermined server with 
the local distribution delegated to the server. This repli- 
cated cache mechanism may need an efficient tree 
building algorithm. Several examples of this are already 
used in internet protocols for static distributions of rep- 
lica caches. 

There are significant problems that are not solved 
by existing practices in the replication of caches on the 
internet. The main problem is the dynamic nature of in- 
ternet traffic, both over time and location. There are lo- 
cations which are active at certain times of the day and 
at other times they are inactive. What is needed is a dy- 
namic technique for assigning and reassigning servers 
in the network that will perform the role of replicated 
caches. 

Summary 

In one application of the invention, a network in- 
cludes a plurality of regions, with each region including 
a router, a server, and a plurality of user terminals. The 
router in each region couples the server and the plurality 
of user terminals to the network. In accordance with the 
invention, the distribution of replica caches of informa- 
tion is dynamically optimized among the regions of the 
network, based upon the changing usage patterns. This 
operation of the invention is referred to herein as reverse 
tree generation. Initially, a single server in a first region 
stores the primary copy of information. A router in the 
first region receives service request messages from the 
plurality of users throughout network, directed to the 
server storing the primary copy. 

A processor, which can be in the router or associ- 
ated with the router, transmits a monitor request mes- 
sage to other routers in other regions of the network, 
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requesting them to monitor all service requests in their 
respective regions, which are directed to the primary 
copy in the first server. In response to this, the processor 
receives monitor response messages from the other 
s routers, providing a count of service requests being di- 
rected to the first server. 

The processor then computes an optimum assign- 
ment of a replica copy of the information, for storage in 
at least one newly assigned server in the network. The 
10 processor transmits a replication message to the newly 
assigned server, to enable it to store the replica copy of 
the information and provide service to user terminals in 
a newly assigned set of regions of the network. The 
processor also transmits an assignment message to the 
is other routers in the network, to enable them to redirect 
traffic from their respective regions to either the newly 
assigned server or to the first server. 

The discussion in this Summary and the following 
Brief Description of the Drawings, Detailed Description, 
and drawings only deal with examples of this invention 
and are not to be considered in any way a limitation on 
the scope of the exclusionary rights conferred by a pat- 
ent which may issue from this application. The scope of 
such exclusionary rights is set forth in the claims at the 
end of this application. 

Brief Description of the Drawings 

Figures 1 A to 1 E show a succession of states of an 
example network organizing itself into a primary cache 
and two replica caches for three service domains. 

Figure 1 F shows the result of the network of Fig. 1 E 
periodically reorganizing itself into a primary cache and 
revised replica caches with revised service domains. 

Figs. 2A to 2C show message formats for router 
control for the example network of Figs. 1 A to 1 E. 

Fig. 2D shows an assignment message and Fig. 2E 
shows a replication message for the example network 
of Fig. 1 E, 

Fig. 3 shows an architectural diagram of the router 
30 in Fig. 1 A. 

Fig. 4 shows a flow diagram of the optimum replica 
cache assignment program. 

Fig. 5 shows a first stage organization of a second 
example network. 

Fig. 6 shows a second stage organization of the ex- 
ample network of Fig. 5. 

Fig. 7 A and 7B show router tables for the network 
organizations of Figs. 5 and 6, respectively. 

Fig. 8A shows an assignment message and Fig. 8B 
shows a replication message for the second stage or- 
ganization of the example network of Fig. 6. 

Detailed Description of the Preferred Embodiments 

Figs. 1A to 1E show a succession of stages of an 
example network as it automatically organizes itself to 
optimize the distribution of replica caches of information 
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among its regions. This operation of the invention is re- 
ferred to herein as reverse tree generation. The exam- 
ple network shown is the internet network in the United 
States, that is built around the internet backbone net- 
work 10. Fig. 1 A shows a first stage in the organization 
of the network when the server 40 in the northeast re- 
gion 20 stores the sole copy of a data set that is ac- 
cessed by users in all the regions of the network. The 
copy of the data set in server 40 is the primary informa- 
tion cache "P". This data set may represent, for exam- 
ple, the information for a particular internet website. Dur- 
ing periods of stable operation, service request messag- 
es are received at router 30 in the northeast region of 
the network, from users in all of the regions of the net- 
work. Those service requests are to access the primary 
cache "P" copy of the data set stored in the server 40. 
If this data set represents a popular internet website, for 
example the New York Times website "www.nytimes. 
com", then router 30 will be kept very busy handling 
these service requests. The invention operates to re- 
duce the load of service requests on router 30, by auto- 
matically distributing one or more replicas of the data 
set to new servers in remote regions of the network and 
assigning new routing tables to the routers in the net- 
work. The new routing tables redirect service requests 
originating from users in remote regions, to the new 
servers which share the load of service requests with 
the original server 40. 

The network of Fig. 1A includes a plurality of re- 
gions: the northeast region 20, the north central region 
22, the northwest region 24, the southeast region 26, 
the south central region 28, and the southwest region 
29. Each of the respective regions includes a router 30, 
32, 34, 36, 38, and 39 connected to the backbone net- 
work 10 by the respective path 30*, 32', 34', 36', 38', and 
39'. Each of the respective regions includes a server 40, 
42, 44, 46, 48, and 49 connected to its respective router 
by the respective path 40', 42', 44', 46*, 48\ and 49'. 
Each of the respective regions includes a plurality of us- 
er terminals 50, 52, 54, 56, 58, and 59 connected to their 
respective router by the respective path 50', 52', 54', 56', 
58', and 59'. In each of the regions, the respective router 
couples the respective server and the respective plural- 
ity of user terminals to the internet backbone network 
10, by means of the links 30', 32', 34', 36', 38', and 39'. 
Fig. 2D shows the existing router table 334 for the orig- 
inal organization of the network in Fig. 1A, where the 
server 40 in the northeast region 20 stores the sole copy 
of a data set as primary cache "P" that is accessed by 
users in all the regions of the network. Router table 334 
is stored at each router 30, 32, 34, 36, 38, and 39 in the 
network. 

Fig. 1 B shows the next stage of the network organ- 
izing itself for optimizing the distribution of replica cach- 
es of information among the regions. Router 30, or a 
processor associated with router 30, transmits a monitor 
request message to all of the other routers 32, 34, 36, 
38, and 39 connected to the backbone network 1 0 re- 
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questing the other routers to monitor service requests 
originating in their regions, that are directed to the pri- 
mary cache in server 40. Fig. 2A shows an example of 
a monitor request message 200 that is sent out by Rout- 

s er 30 to all of the other routers 32, 34, 36, 38, and 39 
connected to the backbone network 1 0. The monitor re- 
quest message 200 includes a field 202 specifying the 
destination router, a field 204 specifying the source rout- 
er, and a field 206 specifying the monitor request. The 

10 monitor request message 200 is broadcast to all other 
routers 32, 34, 36, 38, and 39 in the network. 

Fig. 3 is an architectural diagram of the router 30. 
All of the other routers 32, 34, 36, 38, and 39 in the net- 
work have a similar architecture. Router 30 includes a 

is memory 302 connected by the system bus 304 to the 
network adapter 308, the disk drive storage 306, the 
processor 310, and the server interface 312. The net- 
work adapter 308 connects the router 30 over path 30' 
to the backbone network 10. The server interface 312 

20 connects the router 30 over path 40' to the server 40. 
The interface 31 2 can also connect the router 30 over 
path 50' to the users 50 in Fig. 1 A. 

The memory 302 in Fig. 3 stores the service request 
message buffer 322 that receives service request mes- 

25 sages from the backbone network 1 0 and from the local 
users 50. Service request messages are passed on to 
the server 40 for processing. The memory 302 in Fig. 3 
stores the service response message buffer 324 that re- 
ceives service response messages from the server 50. 

30 Service response messages are forwarded to the back- 
bone network 10 or to the local users 50. 

The memory 302 in Fig. 3 stores the monitor re- 
quest message buffer 326 that stores the monitor re- 
quest message 200 of Fig. 2A. In router 30, which orig- 

35 inates the message 200, the message is built in buffer 
326 and then transmitted by the network adapter 308 to 
the other routers 32, 34, 36, 38, and 39. This stage is 
depicted in Fig. 1B, where router 30 broadcasts a re- 
quest for traffic monitoring to the other routers. In routers 

40 32, 34, 36, 38, and 39 which receive the message 200 
over their respective network adapters 308, the mes- 
sage 200 is buffered and read. The monitor request 
message 200 identifies the data set stored as the pri- 
mary cache P P" in the server 40. The monitor request 

45 message 200 asks the destination routers 32, 34, 36, 
38, and 39 to monitor all messages that they handle and 
to count the number of service requests directed to serv- 
er 40 that request access to the data set stored as the 
primary cache "P". The monitor request message 200 

50 asks the destination routers 32, 34, 36, 38, and 39 to 
respond to router 30 with a traffic value that is the accu- 
mulated count of such service requests over a given pe- 
riod of time. 

The memory 302 in Fig. 3 stores the traffic monitor- 
55 ing program 329. It runs under the operating system pro- 
gram 320, with its instructions being executed on the 
processor 310. In routers 32, 34, 36, 38, and 39 which 
receive the monitor request message 200 over their re- 
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spective network adapters 308, the traffic monitoring 
program 329 reads the message 200 buffered in buffer 
326 and performs the traffic monitoring operation shown 
in Fig. 1C. In each of the destination routers 32, 34, 36, 
38, and 39, the traffic monitoring program 329 monitors 
all messages that the router handles and counts the 
number of service requests directed to server 40 that 
request access to the data set stored as the primary 
cache "P". The traffic monitoring program 329 in the 
originating router 30 also performs this monitoring op- 
eration for the users 50 in the router's own region 20. 
The traffic monitoring program 329 in each of the routers 
30, 32, 34, 36, 38, and 39 responds to router 30 with a 
traffic value that is the accumulated count of such serv- 
ice requests over a given period of time. The memory 
302 in Fig. 3 stores the monitor response message buff- 
er 328 that stores the monitor response message 220. 
The monitor response message 220 includes afield 222 
specifying the destination router, a field 224 specifying 
the source router, and a field 226 specifying the monitor 
response which is the traffic value measured at that rout- 
er. The monitor response message 220 is then transmit- 
ted by each router 32, 34, 36, 38, and 39 back to the 
originating router 30. 

The memory 302 in Fig. 3 stores the optimum rep- 
lica cache assignment program 330. It runs under the 
operating system program 320, with its instructions be- 
ing executed on the processor 310. In the originating 
router 30 of Fig. 1D, the assignment program 330 com- 
putes the optimum distribution of replicas of the data set 
stored as the primary cache "P". This is based on the 
traffic values in each remote region 22, 24, 26, 28, and 
29 reported by each router 32, 34, 36, 38, and 39, and 
also as locally measured in region 20 by router 30. The 
optimum replica cache assignment program 330 is de- 
scribed in the flow diagram of Fig. 4. 

The optimum replica cache assignment program 
330 is based on an algorithm that performs a cost/ben- 
efit analysis of adding or deleting replica caches. The 
cost/benefit analysis algorithm will be described in 
greater detail below, in connection with an example of 
its application to a second network shown in Figs. 5 and 
6. The assignment program 330 periodically updates 
this cost/benefit analysis based on updated traffic mon- 
itoring measurements from the remote regions of the 
network. The cost/benefit analysis algorithm determines 
if a critical threshold quantity of traffic has accumulated 
in a region that justifies setting up a replica cache on a 
new server. The cost/benefit analysis algorithm uses a 
minimum weight spanning tree analysis or a Steiner tree 
analysis to make that determination. The assignment 
program 330 then computes new routing tables for the 
routers in the network, to redirect some of the service 
requests to the replica cache on the new server. The 
memory 302 in Fig. 3 stores the existing router table 334 
and the new router table 336. An example of the existing 
router table 334 is shown in Fig. 2D and an example of 
the new router table 336 is shown in Fig. 2E. 
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The memory 302 in Fig. 3 stores the assignment 
message buffer 332 where the router 30 builds an as- 
signment message 210, as shown in Fig. 2C. The as- 
signment message 210 includes a field 212 specifying 

s the destination router, a field 214 specifying the source 
router, and a field 216 specifying the new routing assign- 
ment computed by the cost/benefit analysis algorithm. 
Fig. 2F shows an example of the assignment message 
210. The field 212 specifies a broadcast to destination 

10 routers 32, 34, 36, 38, and 39. The field 214 specified 
the source router as router 30. And the assignment field 
216 specifies an example of the new router table 336 
shown in Fig. 2E. The assignment message 210 is 
broadcast to all other routers 32, 34, 36, 38, and 39 in 

'5 the network. It is also stored for use in the originating 
router 30. A replication message 802 shown in Fig. 2G, 
is sent from the router 30 to the router 34, for example, 
to provide a copy of the data set as replica cache "A" to 
the new server 44 in region 24. The field 212 specifies 

20 a destination router 34. The field 214 specifies the 
source router as router 30. A corresponding replication 
message is sent from the router 30 to the router 38, for 
example, to provide a copy of the data set as replica 
cache n B D to the new server 48 in region 28. 

25 Fig. 1 E shows the result of the reverse tree gener- 
ation operation of the invention on the network of Fig. 
1A, after the reassignment message 210 has been 
broadcast by router 30 to all other routers 32, 34, 36, 
38, and 39 in the network. Fig. 2E shows the new router 

30 table for the updated organization of the network in Fig. 
1 E, where two additional replica caches have been cre- 
ated in the network. Replica cache "A" is a new copy of 
the data set stored in new server 44 in region 24. Router 
34 handles all service requests to server 44. The service 

35 domain 60A for server 44 is the northwest region 24 and 
the southwest region 29. Router 34 in region 24 and 
router 39 in region 29 are controlled by the new router 
table 336 of Fig. 2E to redirect to the replica cache "A" 
on server 44, all of the service requests to access the 

40 data set that originate in regions 24 and 29. Replica 
cache "B" is a new copy of the data set stored in new 
server 48 in region 28. Router 38 handles all service re- 
quests to server 48. The service domain 60 B for server 
48 is the north central region 22 and the south central 

45 region 28. Router 32 in region 22 and router 38 in region 
28 are controlled by the new router table 336 of Fig. 2E 
to redirect to the replica cache "B" on server 48, all of 
the service requests to access the data set that originate 
in regions 22 and 28. Primary cache u P n is the existing 

so copy of the data set stored in original server 40 in region 
20. Router 30 handles all service requests to server 40. 
The service domain 60 for server 40 is the northeast re- 
gion 20 and the southeast region 26. Router 30 in region 
20 and router 36 in region 26 are controlled by the new 

55 router table 336 of Fig. 2E to redirect to the primary 
cache "P" on server 40, all of the service requests to 
access the data set that originate in regions 20 and 26. 
Fig. 1 F shows the result of the network periodically 
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reorganizing itself into a primary cache and revised rep- 
lica caches with revised service domains. In Fig. 1E, 
router 30 periodically transmits a monitor request mes- 
sage to all of the other routers 32, 34, 36, 38, and 39 
connected to the backbone network 10 requesting the 
other routers to monitor service requests originating in 
their regions, accessing the data set in either replica 
cache "A 0 , replica cache "B", or primary cache "P". The 
routers 32, 34, 36, 38, and 39 receive the monitor re- 
quest message 200 over their respective network adapt- 
ers 308. The traffic monitoring program 329 reads the 
message 200 buffered in buffer 326 and performs the 
traffic monitoring operation. In each of the destination 
routers 32, 34, 36, 38, and 39, the traffic monitoring pro- 
gram 329 monitors all messages that the router handles 
and counts the number of service requests directed to 
server 40, or 44, or 48 that request access to the data 
set stored as the primary cache M P", the replica cache 
"A", or the replica cache "B", respectively. The originat- 
ing router 30 computes the optimum distribution of rep- 
licas of the data set stored as the primary cache "P". 
This is based on the traffic values in each remote region 
22, 24, 26, 28, and 29 reported by each router 32, 34, 
36, 38, and 39, and also as locally measured in region 
20 by router 30. Fig. 1F shows the network of Fig. 1E, 
after the reassignment message 210 has been broad- 
cast by router 30 to all other routers 32, 34, 36, 38, and 
39 in the network. Replica cache "A" remains the same 
copy of the data set stored in server 44 in region 24. 
Router 34 handles all service requests to server 44. The 
service domain 60 A for server 44 has changed to be- 
come the northwest region 24 and the north central re- 
gion 22. Router 34 in region 24 and router 32 in region 
22 are controlled by a new router table to redirect to the 
replica cache "A" on server 44, all of the service re- 
quests to access the data set that originate in regions 
24 and 22. Replica cache M B" has been torn down. Rep- 
lica cache "C° is a new copy of the data set stored in 
new server 49 in region 29. Router 39 handles all service 
requests to server 49. The service domain 60C for serv- 
er 49 is the southwest region 292 and the south central 
region 28. Router 39 in region 29 and router 38 in region 

28 are controlled by the new router table to redirect to 
the replica cache "C" on server 49, all of the service re- 
quests to access the data set that originate in regions 

29 and 28. Primary cache "P" is the existing copy of the 
data set stored in original server 40 in region 20. Router 

30 handles all service requests to server 40. The service 
domain 60 for server 40 is the northeast region 20 and 
the southeast region 26. Router 30 in region 20 and rout- 
er 36 in region 26 are controlled by the new router table 
to redirect to the primary cache "P" on server 40, all of 
the service requests to access the data set that originate 
in regions 20 and 26. In this manner the network peri- 
odically reorganizes itseff into a primary cache and re- 
vised replica caches with revised service domains to ac- 
commodate dynamically changing traffic patterns. 

The optimum replica cache assignment program 
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330 is described in the flow diagram of Fig. 4. The as- 
signment program 330 uses a cost/benefit analysis al- 
gorithm 415 that is a type of minimum weight spanning 
tree analysis. The links connecting all of the router 

5 nodes in the network form an undirected graph. Since 
the links connect all of the routers in the network, it forms 
a tree that spans the graph, thus the name "spanning 
tree". Each of the links is weighted with a cost value re- 
lated to the number of requests per unit time "T". The 

10 value "T" is the number of requests passing over the link 
for a data set of a given size that is stored in a target 
cache on a target server. The value of "T" is the mes- 
sage traffic number of requests per unit time of the link. 
The sum of the number of requests per unit times for all 

15 of the links is related to the bandwidth cost C[BW]. The 
total cost C[T] of the existing distribution of caches, in- 
cludes the bandwidth cost C[BW] as a factor. The pur- 
pose of the optimum replica cache assignment program 
330 is to find a distribution of replica caches storing the 

20 data set, that minimizes the total cost C[T]. Thus, the 
cost/benefit analysis algorithm 41 5 is a type of minimum 
weight spanning tree analysis. 

Step 402 begins by calculating the total cost C[T] of 
the existing distribution of caches. In order to evaluate 

25 the impact of the adding replicated server, a cost func- 
tion is defined that takes into account the resources uti- 
lized and the various costs. This includes the opportu- 
nity cost in case a single server (or a series of servers) 
is overloaded and the potential customers drop out be- 
so fore being served. 

Let C[T] denote the total cost. Define C[BW] as the 
bandwidth cost, C[STO] as the storage cost at the base 
location, C[REP] as the replication cost in a single serv- 
er, C[OC] as the opportunity cost, C[UP] is the update 

55 cost associated with updating the replicated copies to 
keep them from being stale. 

Hence, C[T] = C[BW] + C[STO] + C[REP] + C[OC] + C 
[UP]. 

The bandwidth cost C[BW] is a function of the 

40 number of requests per unit times used on the individual 
links and is the product of a proportionality constant at- 
tributed to the number of requests per unit time, times 
the sum of the number of requests per unit times on each 
link. The calculation of C[BW] can be illustrated with the 

45 example network shown in Fig. 5. 

Fig. 5 shows a second network with eight routers a, 
b, c, d, e, f, g, and h connecting eight user local access 
points 1 , 2, 3, 4, 5, 6, 7, and 8 so that they can access 
a data set whose sole copy is stored as the primary 

50 cache "P" on server 40. Server 40 is connected by router 
"a" to the other routers b, c, d, e, f, g, and h in the net- 
work. Router "a" is currently handling 1000 requests per 
hour for a ten-megabyte file stored in primary cache "P" 
as part of the data set on server 40. In accordance with 

55 the invention, reverse tree generation is used to dynam- 
ically reorganize the caches in the network in response 
to usage patterns. Router "a" periodically broadcasts a 
monitoring request to the other routers b, c, d, e, f, g, 
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and h to begin monitoring each of the network links that 
it handles for a period, e.g. of one hour. Fig. 7A shows 
the existing router table 334 for the network of Fig. 5. 

During the monitoring period, each user local ac- 
cess point makes a quantity of T - requests for the ten- 
megabyte file stored in primary cache "P p as part of the 
data set on server 40. The value of "T" is the message 
traffic number of requests per unit time. User local ac- 
cess point 1 is monitored by its servicing router n c" to 
have a message traffic number of requests per unit time 
to server 40 of T = 1 25. User local access point 2 is mon- 
itored by its servicing router "c" to have a message traffic 
number of requests per unit time to server 40 of T = 50. 
User local access point 3 is monitored by its servicing 
router "d" to have a message traffic number of requests 
per unit time to server 40 of T r: 250. User local access 
point 4 is monitored by its servicing router n e" to have a 
message traffic number of requests per unit time to serv- 
er 40 of T = 225. User local access point 5 is monitored 
by its servicing router T to have a message traffic 
number of requests per unit time to server 40 of T = 50. 
User local access point 6 is monitored by its servicing 
router "g" to have a message traffic number of requests 
per unit time to server 40 of T = 50. User local access 
point 7 is monitored by its servicing router "a" to have a 
message traffic number of requests per unit time to serv- 
er 40 of T = 50. User local access point 8 is monitored 
by its servicing router °b M to have a message traffic 
number of requests per unit time to server 40 of T = 200. 

During the monitoring period, each network link is 
monitored to pass a quantity of "T" requests for the ten- 
megabyte file stored in primary cache "P" as part of the 
data set on server 40. The eleven network links inter- 
connecting the routers of Fig. 5 are monitored as fol- 
lows. Network link "a-b" connecting routers "a" and "b" 
has a message traffic number of requests per unit time 
to server 40 of T = 850. Network link "b-c" connecting 
routers "b u and "c" has a message traffic number of re- 
quests per unit time to server 40 of T = 175. Network 
link "c-d" connecting routers "c" and "d" has a message 
traffic number of requests per unit time to server 40 of 
T = 0. Network link "d-e" connecting routers "d" and D e u 
has a message traffic number of requests per unit time 
to server 40 of T = 0. Network link "e-f" connecting rout- 
ers u e" and T has a message traffic number of requests 
per unit time to server 40 of T = 0. Network link "f-g" 
connecting routers T and "g" has a message traffic 
number of requests per unit time to server 40 of T = 50. 
Network link °g-a" connecting routers "g" and "a" has a 
message traffic number of requests per unit time to serv- 
er 40 of T = 100. Network link "d-h" connecting routers 
"d" and "h" has a message traffic number of requests 
per unit time to server 40 of T = 250. Network link "e-h" 
connecting routers "e° and "h" has a message traffic 
number of requests per unit time to server 40 of T = 225. 
Network link "f-h" connecting routers "f and "h" has a 
message traffic number of requests per unit time to serv- 
er 40 of T = 0. Network link "h-b" connecting routers "h p 



and V has a message traffic number of requests per 
unit time to server 40 of T = 475. 

The bandwidth cost C[BW] for the network of Fig. 5 
is calculated in step 402 of Fig. 4, to be the product of 

5 a proportionality constant attributed to the number of re- 
quests per unit time, times the sum of the number of 
requests per unit times on each link. For this example 
the number of requests per unit time cost per megabit 
is assumed to be 0.1. The sum of the values of T" for 

JO the eleven network links is 2125. Thus, the bandwidth 
cost C[BW] calculated in step 402 for the network of Fig. 
5 is 21 25 (which is 21 25 requests times 10 megabits per 
request times 0.1 cost per megabit.) 

Next, step 402 of Fig. 4 calculates C[STO], the stor- 
es age cost at the base location. For this example, this val- 
ue is 500. Next, step 402 of Fig. 4 calculates C[REP], 
the replication cost in a single server. Since there is no 
replication yet, this value is zero. Next, step 402 of Fig. 
4 calculates C[OC] the opportunity cost. Market re- 

20 search shows that the opportunity cost per lost hit is five. 
Empirical observation of the number of lost hits per hour 
is 1 00. Thus, the value of C[OC] for the opportunity cost 
is 500, for this example. Next, step 402 of Fig. 4 calcu- 
lates C[UP] the update cost associated with updating 

2s the replicated copies to keep them from being stale. 
Since there is no replication yet, this value is zero. Thus, 
step 402 of Fig. 4 calculates C[T] = C[BW] + C[STO] + 
C[REP] + C[OC] + C[UP] to have a value of 21 25 + 500 
+ 0 + 500 + 0 = 3125. 

30 Next, step 404 of Fig. 4 sorts the links that terminate 
at the destination cache or caches, from the highest link 
to lowest cost. For the example network of Fig. 5, the 
sorted order is as follows. 
Network link "a-b" T = 850. 

35 Network link "g-a" T = 1 00. 

Next, step 406 of Fig. 4 starts a loop. For each link 
V terminating at an existing cache, perform steps 408 
and 410. Step 408 starts a nested loop, for each source - 
"S" contributing to the link T, perform step 410. Step 

40 41 0 finds the depth (the number of segments along the 
path to the link) and the weight (e.g. , the bandwidth con- 
tribution) of the source. For the network of Fig. 5, step 
406 starts with the highest cost link which is network link 
"a-b" with T = 850. 

45 The sources for traffic to network link "a-b" are rout- 
ers c, d, e, and h and the user local access point 8. Step 
408 starts the nested loop, for each source contributing 
to the network link "a-b", perform step 410. For the con- 
tribution of router "e", Step 410 finds the depth (the 

50 number of segments along the path to the link) as three: 
segment e-h, segment h-b, and segment b-a. Then step 
410 multiplies the depth value of three times the number 
of requests per unit time from router "e" of 225, resulting 
in a contribution of 675 from router "e". If router "e° were 

55 eliminated from contributing to the network link "a-b", 
then a cost of 675 could be avoided. For example, server 
47 is connected to router "e" in Fig. 5. If server 47 were 
made a replica server for the data set in primary cache 
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■P", then a cost of 675 would be avoided. However, this 
cost avoidance would have to be balanced against the 
cost of replicating the data set in server 47. This balanc- 
ing is carried out in steps 414 and 416, as will be de- 
scribed later. 

For the contribution of router "d", Step 410 finds the 
depth (the number of segments along the path to the 
link a-b) as three: segment d-h, segment h-b, and seg- 
ment b-a. Then step 410 multiplies the depth value of 
three times the number of requests per unit time from 
router "d" of 250, resulting in a contribution of 750 from 
router "d". If router "d" were eliminated from contributing 
to the network link "a-b n , then a cost of 750 could be 
avoided. For example, server 49 is connected to router 
"d" in Fig. 5. If server 49 were made a replica server for 
the data set in primary cache "P M , then a cost of 750 
would be avoided. However, this cost avoidance would 
have to be balanced against the cost of replicating the 
data set in server 49. This balancing is carried out in 
steps 414 and 416, as will be described later. 

For the contribution of router V, Step 41 0 finds the 
depth (the number of segments along the path to the 
link) as two: segment c-b and segment b-a. Then step 
410 multiplies the depth value of two times the number 
of requests per unit time from router "c" of 1 75, resulting 
in a contribution of 350 from router "c B . If router "c" were 
eliminated from contributing to the network link "a-b n , 
then a cost of 350 could be avoided. For example, server 
48 is connected to router "c" in Fig. 5. If server 48 were 
made a replica server for the data set in primary cache 
"P n , then a cost of 350 would be avoided. However, this 
cost avoidance would have to be balanced against the 
cost of replicating the data set in server 48. This balanc- 
ing is carried out in steps 414 and 416, as will be de- 
scribed later. 

For the contribution of router "h", Step 410 finds the 
depth (the number of segments along the path to the 
link) as two: segment h-b and segment b-a. Then step 
410 multiplies the depth value of two times the number 
of requests per unit time from router "h" of 475, resulting 
in a contribution of 950 from router "h". If router "h" were 
eliminated from contributing to the network link "a-b n , 
then a cost of 950 could be avoided. For example, server 
44 is connected to router "h" in Fig. 5. If server 44 were 
made a replica server for the data set in primary cache 
"P", then a cost of 950 would be avoided. However, this 
cost avoidance would have to be balanced against the 
cost of replicating the data set in server 44. This balanc- 
ing is carried out in steps 414 and 416, as will be de- 
scribed later. 

For the contribution of user local access point 8, 
Step 41 0 finds the depth (the number of segments along 
the path to the link) as one: segment b-a. Then step 410 
multiplies the depth value of one times the number of 
requests per unit time from the user local access point 
of 200, resulting in a contribution of 200 from user local 
access point 8. If user local access point 8 were elimi- 
nated from contributing to the network link "a-b", then a 
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cost of 200 could be avoided. For example, server 46 is 
connected to router "b" in Fig. 5. If server 46 were made 
a replica server for the data set in primary cache "P", 
then a cost of 850 would be avoided. However, this cost 
s avoidance would have to be balanced against the cost 
of replicating the data set in server 46. This balancing 
is carried out in steps 414 and 416, as will be described 
later. 

For the network of Fig. 5, the loop beginning at step 
10 406 continues with the next highest cost link which is 
network link "g-a" with T = 100. 

Next, step 412 of Fig. 4 evaluates candidate server 
nodes based on the calculated depths and weights de- 
termined from steps 406, 408, and 410. In the example 
is of Fig. 5, there are six candidate servers on which can 
be stored a replica cache of the data set in primary 
cache *P". Server 42 is connected by router T to the 
network. Server 44 is connected by router "h" to the net- 
work. Server 46 is connected by router a b* to the net- 
20 work. Server 47 is connected by router "e" to the net- 
work. Server 48 is connected by router "c" to the net- 
work. And server 49 is connected by router "d" to the 
network. 

Step 412 uses the results of the loop performed by 
25 steps 406, 408, and 410, discussed above. If server 44 
were made a replica server for the data set in primary 
cache "P°, then a cost of 950 would be avoided. If server 
46 were made a replica server for the data set in primary 
cache "P", then a cost of 850 would be avoided. If server 
30 47 were made a replica server for the data set in primary 
cache B P B , then a cost of 675 would be avoided. If server 

48 were made a replica server for the data set in primary 
cache "P\ then a cost of 350 would be avoided. If server 

49 were made a replica server for the data set in primary 
35 cache "P°, then a cost of 750 would be avoided. Evalu- 
ating these results, step 412 determines that the best 
candidate for a replica cache is server 44 connected to 
router "h". 

Next, step 414 of Fig. 4 creates a trial, new network 

40 traffic pattern. It does this by assigning the primary 
cache traffic previously directed from those routers near 
the candidate, replica cache server, to be redirected 
from going to the primary cache server, to going to the 
candidate server. Then step 41 4 calculates the total cost 

45 C[T] of the trial, new network traffic pattern. 

Fig. 6 shows the network of Fig. 5, with the trial, new 
network traffic pattern directed to the candidate, replica 
cache "A" on server 44 and to the primary cache "P" on 
server 40. Step 414 assigns the primary cache traffic 

so previously directed from those routers d, e, and f that 
are near the candidate, replica cache server 44, to be 
redirected from going to the primary cache server 40, to 
going to the candidate server 44. Then step 41 4 calcu- 
lates the total cost C[T] of the trial, new network traffic 

55 pattern for the network of Fig. 6. 

The eleven network links interconnecting the rout- 
ers of the trial, new network traffic pattern for the network 
of Fig. 6, have the following traffic. Network link "a-b" 
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connecting routers "a" and "b" has a message traffic 
number of requests per unit time to server 40 of T = 375. 
Network link "b-c" connecting routers "b" and V has a 
message traffic number of requests per unit time to serv- 
er 40 of T = 175. Network link "c-d" connecting routers 
"c" and "d" has a message traffic number of requests 
per unit time of T = 0. Network link "d-e" connecting rout- 
ers "d" and "e" has a message traffic number of requests 
per unit time of T = 0. Network link "e-f connecting rout- 
ers "e" and 1" has a message traffic number of requests 
per unit time of T = 0. Network link "f-g" connecting rout- 
ers T and M g" has a message traffic number of requests 
per unit time of T = 0. Network link "g-a" connecting rout- 
ers °g B and "a" has a message traffic number of requests 
per unit time to server 40 of T = 50. Network link "d-h" 
connecting routers "d" and "h" has a message traffic 
number of requests per unit time to replica cache server 
44 of T = 250. Network link "e-h" connecting routers 'e D 
and "h" has a message traffic number of requests per 
unit time to replica cache server 44 of T = 225. Network 
link l-h" connecting routers T and "h" has a message 
traffic number of requests per unit time to replica cache 
server 44 of T = 50. Network link B h-b a connecting rout- 
ers "h" and "b" has a message traffic number of requests 
per unit time of T = 0. 

The bandwidth cost C[BW] for the network of Fig. 6 
is calculated in step 414 of Fig. 4, to be the product of 
a proportionality constant attributed to the number of re- 
quests per unit time, times the sum of the number of 
requests per unit times on each link. For this example 
the number of requests per unit time cost per megabit 
is assumed to be 0. 1 . The sum of the values of "T for 
the eleven network links is 1125. Thus, the bandwidth 
cost C[BW] calculated in step 41 4 for the network of Fig. 
6 is 1125. 

Next, step 414 of Fig. 4 calculates C[STO], the stor- 
age cost at the base location. For this example, this val- 
ue is 500. Next, step 414 of Fig. 4 calculates C[REP], 
the replication cost in the single replica cache server 44. 
In this example the replication cost per megabits (per 
day) is 50. For a ten megabit file, this amounts to a rep- 
lication cost C[REP] of 500. Next, step 41 4 of Fig. 4 cal- 
culates C[OC] the opportunity cost. Since the addition 
of the replica cache "A" is anticipated to meet all user 
requests for access to the data base, the value of C[OC] 
for the opportunity cost is zero, for this example. Next, 
step 414 of Fig. 4 calculates C[UP] the update cost as- 
sociated with updating the replicated copies to keep 
them from being stale. The update cost is 100, for this 
example. Thus, step 414 of Fig. 4 calculates C[T] = C 
[BW] + C[STO] + C[REP] + C[OC] + C[UP] to have a 
value of 1125 + 500 + 500 + 0 + 100 = 2225. 

Next, step 416 of Fig. 4 decides whether the trial, 
new network traffic pattern provides a lower total cost C 
[T] than does the existing distribution of caches. For the 
network of Fig. 6, step 41 6 determines the trial, new net- 
work traffic pattern provides a lower total cost C[T] of 
2225 than does the existing distribution of caches in Fig. 



5, which has a total cost C[T] of 3125. Thus, step 416 
decides that trial, new network traffic pattern of Fig. 6 
should be implemented and become the basis for a new 
router assignment for the routers of the network. 

5 Next, step 418 of Fig. 4 broadcasts a new assign- 
ment to all routers. For the example of Fig. 6, router "a" 
broadcasts the new router table 336 is shown in Fig. 7B 
to the routers b, c, d, e, f, g, and h. This illustrates the 
principle of group casting by the invention, where a sin- 

10 gle router table 336 is distributed to many network rout- 
ers, to selectively provide the identity of the destination 
cache, depending on the identity of the router perform- 
ing the routing function. The assignment message 21 0, 
is shown in Fig. 8A. The assignment message 210 in- 

is eludes a field 212 specifying the destination router, a 
field 214 specifying the source router, and a field 216 
specifying the new routing assignment computed by the 
cost/benefit analysis algorithm 415. A replication mes- 
sage 802 shown in Fig. SB, is sent from the router "a" 

20 to the router "h", for example, to provide a copy of the 
data set as replica cache "A" to the new server 44. The 
field 212 specifies a destination router "h". The field 214 
specifies the source router as router "a". 

Step 418 of Fig. 4 then loops back to step 402. The 

25 loop back to step 402 can be delayed at step 420 until 
new traffic monitoring data has been received by the 
router 30. The original existing router table 334 in Fig. 
7A is discarded. The new router table 336 in Fig. 7B be- 
comes the new existing router table. Thus, the optimum 

30 replica cache assignment program 330 of Fig. 4 period- 
ically reassesses the traffic distribution in the network, 
thereby adapting it to changing traffic patterns. 

The assignment program 330 also provides for de- 
leting replica caches from the network when they are no 

35 longer needed. Assume the traffic pattern changes from 
that shown in Fig. 6. User local access point 4 changes 
to a message traffic number of requests per unit time to 
replica cache "A" server 44 of T = 25. User local access 
point 3 changes to a message traffic number of requests 

40 per unit time to replica cache "A" server 44 of T = 50. 
Then step 412 calculates the tradeoff for deleting the 
replica cache "A" from server 44. Step 412 determines 
that the increase in the traffic on links h-b and b-a would 
be T = 150, if the replica cache "a" were deleted. Step 

45 412 also determines that the deletion of replica cache 
"A" would save the replication cost of 500. Step 41 2 rec- 
ognizes that there is a net gain of 350 if the replica cache 
were removed. Step 416 then decides to tear down the 
replica cache "A". Step 418 then broadcasts the new 

50 assignment to all routers in the network. 

In accordance with the invention, reverse tree gen- 
eration is used to dynamically reorganize the caches in 
the network in response to usage patterns. This solves 
the problem of the dynamic nature of internet traffic, both 

55 over time and location. There are locations which are 
active at certain times of the day and at other times can 
be considered very inactive. The invention overcomes 
this by making its decision to replicate caches dynami- 
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cally, based on usage patterns. The invention continu- 
ously updates "grow" decisions to add replica caches 
and "prune" decisions to delete them. This results a dy- 
namic tree with servers at its dynamically changing core 
locations. Effectively, a minimum weight spanning tree 
is being built 'on the fly*. To avoid the accumulation of 
stale data at the replica cache servers, each of these 
servers will be continuously (if necessary) updated by 
using a protocol such as server cache synchronization 
protocol. 

One aspect in the implementation of the invention 
is the way users access the replicated cache server 
closest to them. One embodiment is to let users first 
reach their destination point and find out the tree struc- 
ture for that application, and then establish connection 
to the node in the spanning tree that is most appropriate. 
An alternate embodiment is to treat each application as 
a multicast group, and users requesting to join the group 
be routed accordingly by Groupcast capable routers. 
Groupcast (GC) is conceptually similar to the multicast 
however works in the opposite direction. A GC router 
keeps track of an updated routing table, and under nor- 
mal conditions will act as a regular router. As a new min- 
imum weight spanning tree starts to be developed, the 
entries in the routing table are updated. 

The invention provides a way to change the distri- 
bution of replica caches, based on dynamic usage pat- 
terns. The invention accomplishes this by a reverse tree 
generation technique. The invention uses the opportu- 
nity cost of loosing unsuccessful customers, to dynam- 
ically optimize the distribution of replica caches. 

An additional feature of the monitoring process per- 
formed by the network routers, is the collection of server 
hit information for the underlying data set application. 
This is useful for market research by the owner of the 
data set in the cache. There are other features of the 
invention, such as regional applications that accommo- 
date the regional differentiation in the servers. For ex- 
ample, place regional editions of a newspaper, adver- 
tisement, promotions etc in the local servers. 

various illustrative examples of the invention have 
been described in detail. In addition, however, many 
modifications and changes can be made to these exam- 
ples without departing from the nature and spirit of the 
invention. 



Claims 

1. In a network including a plurality of regions, each 
region including a router, a server, and a plurality of 
user terminals, the router coupling the server and 
the plurality of user terminals to the network, a 
method for distributing replica caches of information 
among the regions, comprising the steps of: 

ascertaining with a processor in a first region, 
a measure of requests made by user terminals 
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in the network for said information; and 
responsive to said measure of requests, said 
processor transmitting a message to a router in 
at least one other region to store a copy of said 
s information as a replica cache. 

2. The method of claim 1 , further comprising the step 
of: 

10 responsive to said measure of requests, said 

processor in said first region transmitting a 
message to a router in at least one other region 
to redirect said requests made by user termi- 
nals to said replica cache. 

15 

3. The method of claim 1 , further comprising the step 
of: 

responsive to said measure of requests said 
20 processor in said first region selecting said 

router in at least one other region by means of 
a cost/benefit analysis. 

4. The method of claim 1 , further comprising the step 
25 of: 

responsive to said measure of requests said 
processor in said first region selecting said 
router in at least one other region by means of 
30 a cost/benefit analysis that calculates the total 

cost of a distribution of caches in the network. 

5. The method of claim 1 , further comprising the step 
of: 

35 

responsive to said measure of requests said 
processor in said first region selecting said 
router in at least one other region by means of 
a cost/benefit analysis that calculates a lost op- 
40 portunity cost of not having a replica cache. 

6. In a network including a plurality of regions, each 
region including a router, a server, and a plurality of 
user terminals, the router coupling the server and 

45 the plurality of user terminals to the network, a 
method for distributing cache replicas of information 
among the regions, comprising the steps of: 

transmitting from a first region a monitor re- 
so quest message to a plurality of routers in other 

regions, requesting said other routers to moni- 
tor requests directed to a first server in said first 
region, to deliver said information; 
receiving monitor response messages from 
55 said other routers; and 

responsive to said response messages, trans- 
mitting a message to at least one of said other 
routers to store a copy of said information as a 
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replica cache. 

7. The method of claim 6, further comprising the step 
of: 

5 

responsive to said response messages, trans- 
mitting a message to a router in at least one 
other region to redirect said requests to said 
first server, to said replica cache. 

10 

8. The method of claim 6, further comprising the step 
of: 

responsive to said response messages, select- 
ing said first server by means of a cost/benefit is 
analysis. 

9. The method of claim 6, further comprising the step 
of: 

20 

responsive to said response messages, select- 
ing said first server by means of a cost/benefit 
analysis that calculates the total cost of a dis- 
tribution of caches in the network. 

25 

10. The method of claim 6, further comprising the step 
of: 

responsive to said response messages, select- 
ing said first server by means of a cost/benefit 30 
analysis that calculates a lost opportunity cost 
of not having a replica cache. 

11. In a network including a plurality of regions, each 
region including a router, a server, and a plurality of 35 
user terminals, the router coupling the server and 
the plurality of user terminals to the network, a 
method for optimizing the distribution of replica 
caches of information among the regions, compris- 
ing the steps of: 40 
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of the network; and 

transmitting an assignment message from said 
first router to said another server, to enable said 
another server to store said replica copy of said 
information and provide service to user termi- 
nals in an assigned subplurality of regions of 
the network. 

12. The method of claim 11, further comprising the step 
of: 

said computing step including selecting said 
another server by means of a cost/benefit anal- 
ysis. 

1 3. The method of claim 1 1 , further comprising the step 
of: 

said computing step including selecting said 
another server by means of a cost/benefit anal- 
ysis that calculates the total cost of a distribu- 
tion of caches in the network. 

1 4. The method of claim 1 1 , further comprising the step 
of: 

said computing step including selecting said 
another server by means of a cost/benefit anal- 
ysis that calculates a lost opportunity cost of not 
having a replica cache. 

15. In a network including a plurality of regions, each 
region including a router, a server, and a plurality of 
user terminals, the router coupling the server and 
the plurality of user terminals to the network, a 
method for optimizing the distribution of replica 
caches of information among the regions, compris- 
ing the steps of: 

forwarding service request messages from a 
plurality of users in the network, to a first server 
in the first region storing a primary copy of in- 
formation; 

receiving at a second router in a second region 
of the network, a monitor request message 
from said first router requesting the second 
router to monitor service requests being direct- 
ed to said first server; 

monitoring at said second router, service re- 
quests being directed to said first server; 
transmitting a monitor response message from 
said second router to said first router, providing 
a count of service requests being directed to 
said first server; 

receiving an assignment message at said sec- 
ond router from said first router, including an 
optimum assignment of a replica copy of said 
information for storage at a second server in 



receiving service request messages at a first 
router in a first region of the network, from a 
plurality of users in the network, the service re- 
quests being directed to a first server in the first 45 
region storing a primary copy of information; 
transmitting a monitor request message from 
said first router to a second router in a second 
region of the network, requesting the second 
router to monitor service requests being direct- 50 
ed to said first server; 

receiving a monitor response message at said 
first router from said second router, providing a 
count of service requests being directed to said 
first server; 55 
computing at said first router, an optimum as- 
signment of a replica copy of said information 
for storage at another server in another region 
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said second region of the network; and 
enabling said second server to store said rep- 
lica copy of said information and provide serv- 
ice to user terminals in an assigned subplurality 
of regions of the network. s 

16. In a network including a plurality of regions, each 
region including a router, a server, and a plurality of 
user terminals, the router coupling the server and 
the plurality of user terminals to the network, a sys- 10 
tern for distributing replica caches of information 
among the regions, comprising: 

a router in a first region; 

a processor coupled to said router, for ascer- is 
taininga measure of requests made by user ter- 
minals in the network for said information; 
said router transmitting a message to a router 
in at least one other region to store a copy of 
said information as a replica cache, in response 20 
to said measure of requests. 

17. The system of claim 16, further comprising: 

responsive to said measure of requests, said 25 
router in said first region transmitting a mes- 
sage to a router in at least one other region to 
redirect said requests made by user terminals 
to said replica cache. 

30 

18. The system of claim 16 further comprising: 

responsive to said measure of requests said 
processor selecting said router in at least one 
other region by means of a cost/benefit analy- 35 
sis. 

19. The system of claim 16, further comprising: 

responsive to said measure of requests said 40 
processor selecting said router in at least one 
other region by means of a cost/benefit analysis 
that calculates the total cost of a distribution of 
caches in the network. 

46 

20. The system of claim 16, further comprising: 

responsive to said measure of requests said 
processor selecting said router in at least one 
other region by means of a cost/benefit analysis so 
that calculates a lost opportunity cost of not 
having a replica cache. 
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